Metrics for vector quantization-based parametric speech enhancement and separation.
نویسنده
چکیده
Speech enhancement and separation algorithms sometimes employ a two-stage processing scheme, wherein the signal is first mapped to an intermediate low-dimensional parametric description after which the parameters are mapped to vectors in codebooks trained on, for example, individual noise-free sources using a vector quantizer. To obtain accurate parameters, one must employ a good estimator in finding the parameters of the intermediate representation, like a maximum likelihood estimator. This leaves some unanswered questions, however, like what metrics to use in the subsequent vector quantization process and how to systematically derive them. This paper aims at answering these questions. Metrics for this are presented and derived, and their use is exemplified on a number of different signal models by deriving closed-form expressions. The metrics essentially take into account in the vector quantization process that some parameters may have been estimated more accurately than others and that there may be dependencies between the estimation errors.
منابع مشابه
Using FFI Interpolator and VQ Quantization for Designing of High Quality 1200 BPS Speech Vocoder
Storaging or transmission of speech signals at very low bit rate is a hot area in the field of speech processing. We used stochastic inter-frame interpolators and vector quantization (VQ) as a new method for developing a high quality 1200 BPS speech vocoder. The objective and subjecgtive test results show that performance of the new vocoder is compairable with 4800 BPS standard vocoders (as CELP).
متن کاملUsing FFI Interpolator and VQ Quantization for Designing of High Quality 1200 BPS Speech Vocoder
Storaging or transmission of speech signals at very low bit rate is a hot area in the field of speech processing. We used stochastic inter-frame interpolators and vector quantization (VQ) as a new method for developing a high quality 1200 BPS speech vocoder. The objective and subjecgtive test results show that performance of the new vocoder is compairable with 4800 BPS standard vocoders (as CELP).
متن کاملA Vector-Predictive Multi-Mode Matrix Quantization Approach for Parametric Speech Coding
In parametric speech coding, the accuracy of parameter quantization has a significant effect on speech quality. In this paper, we present a flexible and high-fidelity multi-mode quantization approach that combines the beneficial features of predictive vector quantization and matrix quantization. As an example, the proposed technique is employed in quantization of the power component in a wavefo...
متن کاملThe 2nd ‘chime’ Speech Separation and Recognition Challenge: Approaches on Single-channel Source Separation and Model-driven Speech Enhancement
In this paper, we address the small vocabulary track (track 1) described in the CHiME 2 challenge dedicated to recognize utterances of a target speaker with small head movements. The utterances are recorded in a reverberant room acoustics corrupted with highly non-stationary noise sources. Such adverse noise scenario imposes a challenge to state-of-the-art automatic speech recognition systems. ...
متن کاملCompression of Speech Signals using MSVQ and its Enhancement Using Spectral Subtraction & Kalman filter and its performance comparisions
Coding algorithms seek to minimize the bit rate in the digital representation of a signal without an objectionable loss of signal quality in the process. Speech enhancement means improvement in intelligibility and/or quality of a speech signal. This paper deals with multistage vector quantization technique used for coding of narrow band speech signals. The parameter used for coding of speech si...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- The Journal of the Acoustical Society of America
دوره 133 5 شماره
صفحات -
تاریخ انتشار 2013